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Abstract 


Starting from the seminal work of Knuth and Bendix, we develop several no- 
tions useful in the study of term rewriting systems. In particular we introduce 
the notions of "derived pairs” and “overlap closure” and show that they are use- 
ful in analyzing sets of rewrite rules for various properties related to termi- 
nation. We also introduce a new ropresentation, based on rewrite dominoes, for 


rewrite rules and sequences of. rewrites. 


Keywords: Confluence, Critical pair, Derived pair, Knuth-Bendix, Overlap clo- 


sure, Restricted termination, Rewrite dominoes, Term rewriting systens, Uniform 


termination 


1. Work partially supported by the National Science Foundation 
under grant NMCS78-01798 and by an Office of Naval Research 
Contract with DARPA funding #NO0014—75-C-0661. 


1. I UCTION 


We introduce three new tools for the study of term rewriting systems. 
Derived pairs of a rewrite rule generalize the well-known idea of "critical 
pairs” introduced by Knuth and Bendix (1970) in their development of a method of 
proving the confluence property. The overlap closure of a set of rules is a set 
of rules that corresponds to a subset of the transitive closure of the rewriting 
relation. Its construction is based on the use of derived pairs obtained from 
superpositions of the right hand side of one rule with the left hand side of 
another. This process is closely related to the Knuth-Bendix process, which uses 
critical pairs for generating new rules in an attempt to achieve confiuence. We 
use the overlap closure in proving - or disproving — that a rewriting relation 
is uniformly terminating. It thus provides an interesting dual method to the 
Knuth-Bendix process, in which the validity of the critical pair test for con- 
fluence depends upon uniform termination. The combination of uniform termina— 
tion and confluence provides a decision procedure for the theory of the equa- 
tions corresponding to the original rules. 


In the study of derived pairs and overlap closures we found it useful to 
devise a new way of representing rewrite rules and sequences of rewrites using 
what we call rewrite dominoes and “rewrite domino layouts”. We will introduce 
this representation and use it in presenting the proofs of our main results 
about the overlap closure. We believe that this representation also will be 
useful in the study of other areas of rewrite rule theory. . 


Like the Knuth-Bendix process, the overlap closure process may fail to ter-— 
minate (that is, it may continue to generate new rules indefinitely). In fact, 
when the original rules are uniformly terminating, it will usually happen that 
overlap closure generation is nonterminating. In this case, the overlap closure 
process does not by itself yield a proof of uniform termination, but it may be 
useful as an aid in applying other known methods of proving uniform termination 
{see Huet and Oppen, 1980]. It can also be used in proving what we call “res- 
tricted termination,” i.e., termination for all terms up to a given size. Some 
applications of restricted termination are discussed in [Guttag, Kapur and 
Musser, 1981]. 


Perhaps more important is the case where the original rules are not uni- 
formly terminating. One would often like to be able to detect this situation 
quickly, e.g., in order to avoid wasting time attempting to construct a proof of 
uniform termination. We show that under some reasonable restrictions on the 
form of rewrite rules, the overlap closure construction provides such a test. 


1. more commonly called finitely terminating or noetherian. 


I.e., we show that if the rules are globally finite (that is to say, the number 
of different terms to which any term can be rewritten is finite) and every rule 
is right-linear or every rule is left-linear, the overlap closure construction 
can be used to effectively search for cycles in the rewriting relation. (That 
it does so "quickly" enough to be useful is a claim for which we have limited 
empirical evidence, as discussed in the Conclusion section). 


2. D TION 


For the most part we use standard definitions and terminology for term 
rewriting systems from Huet (1980) and Huet and Oppen (1980). There are a few 
exceptions, such as “uniform termination” for "finite termination,” and “terni- 
nal form” for "normal form.” In [Guttag, Kapur, and Musser, 1981], the reader 
will find a thorough discussion of this background material. Here we confine 
ourselves mainly to the definitions of “derived pairs,” a generalization of the 
Knuth and Bendix’s notion of "critical pairs,” and of "overlap closure.” 


Two terms are said to overlap if one is unifiable with a nonvariable sub- 
term of the other. If s and t overlap, we define their superposition: either 

a) s unifies with a nonvariable subterm t' of t, by the most general unif- 
ier (m.g.u.) 6, in which case 6(t) is called a superposition of s and t; or 

b) a nonvariable subterm s' of s unifies with t, by m.g.u. 6, in which case 
@(s) is called a superposition of s and t. 


Now consider ordered pairs of terms (r,s) and (t,u) such that s and t over- 
lap, as above. (If the variables of t must be renamed, the same renaming must 
be applied to u.) Then along with the superposition 0(t) or @(s) we obtain the 
derived pair of terms, <p,q>, where 

a) if s unifies with a nonvariable subterm t/i by m.g.u. 0, 


p=[0(t) with @(r) at i]? 
q=0(u); 
b) if a nonvariable subterm s/i unifies with t by m.g.u. 9, 
p=0(r) 
q=[@(s) with @(u) at i]. 


In the case of a rewriting system B = {1 - r,)}. the derived pairs obtained 


from the pairs (x51) and (1-73) are called critical pairs. 

1. The notation [t with u at i] stands for the term obtained 

from t by replacing the subterm at position i by s. A "sub- 

term position” and “corresponding subterm” within a term is a 

finite sequence of nonnegative integers separated by ".” and 

a related term determined as follows: to the asnll sequence 

(denoted <>) corresponds the entire teem. If f(t poverty ) is 

the subterm at position i then the subterm at position i. 2 is 

13 We write t/i for the subterm at position i within term 
t 


Consider, for example, obtaining a critical pair from the rewrite rules: 


re exc 


(x' ey’) @ z’->x' @ (y’ © z’) 


We begin by constructing the ordered pairs (ce, x ex) and 

((x' ey’) oz’, x’ @ (y’ @ z')). Now x? ® x can be unified with x’ e y’ using 
the substitution 6 = (x t/x', x/y']. This leads to the derived pair 

<e o z’,x © (x e@2z’)> which is a critical pair of the rules. 


Using derived pairs, the overlap closure of R, written OC(R), is defined 
inductively as follows: 


a. Every rule r — s in R is also in OC(R). 

b. Whenever r > s and t — wu are in OC(R), every derived pair <p,q> of (r,8) 
and (t,u) is in OC(R) (as p > q). 

c. No other rules are in OC(R). 


"Examples of overlap closures:” 


i. Let R= {f(x) — g(x)}, then OC(R) = R. 

ii. Let R = {f(x) — g(h(x)), h(x) — k(x)}, then OC(R) = RB 
U (f(x) > g(k(x))}. 

iii. Let R = {x @ (y © z) > (x @ y) © z}, then from the superposition 
(x @ (x' @ y’)) © z’ we obtain the rule 


xe ((x' ey’) oz’) > ((x ex’) ey’) oz’ 


and from the superposition (x e ((x’ e y’') ez’) we obtain 


xe (x’ @ (y’ 0 z’)) > (xe (x' e y’)) oz’. 


These rules then lead to further rules, and OC(R) is infinite. 
iv. Let R = {f(x) — g(x), g(h(x)) > £(h(x))}. Then OC(R) consists of R and 
the reflexive rules f(h(x)) > f(h(x)) and g(h(x)) — g(h(x)). 


The overlap closure OC(R) has a rich structure since the overlap closure 
construction preserves some properties of a rewriting system R. The following 
theorem shows that every derived pair of two rewrite rules is also a rewrite 
rule, implying that the overlap closure OC(R) is a rewriting systen. 


2.1. Theorem. If r,s,t,u are terms such that (r,s) and (t,u) are rewrite 
rules, then every derived pair <p,q> of (r,s) and (t,u) is also a rewrite rule. 


Proof. One just has to verify that for each case in the definition of derived 
pair that every variable that occurs in q oocurs also in p. oO 


Let us consider some other properties, based on the properties of its 
rules, of a rewriting system R 


A term is said to be linear if no variable occurs in it more than once. A 
rewrite rule is left-lingar if its left term is linear, gight-linear if its 
right term is linear, and linear if its left and right terms are linear. 


A rewriting systom is called left-linear, right-linear, or linear, based on 
whether each of its rules is left-linear, right-linear, or linear, respectively. 
The following theorem implies that the overlap closure OC(R) of a right-linear 
(left-linear, linear) BR is also right-linear (left-linear, linear). 


2.2. Theorem. If r—s and t—u are two right linear rules with disjoint vari- 
able sets, then each of their derived pairs, <p, q> is also right linear. 


Proof. There are three cases: 
(i) s unifies with the subterm t/i of t by their m.g.u. 9. 


The corresponding derived pair <p, q> has 


p = [0(t) with O(r) at i] 


g = O(u) 


Since s is linear, by Lemma 1 in the Appendix, substitutions for any two dis- 
tinct variables in t/i in © do not have a common variable. The variables in t 
other than the ones in t/i do not play any role. So O(u) is linear. 


(ii) the subterm s/i of s unifies with t by their m.g.u. 9. 


The corresponding derived pair <p, q> has 


p = 0(r) 
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q = [0(s) with O(u) at i] 


Since s/i is linear, by Lemma 1 in the Appendix, substitutions for any two dis- 
tinct variables in t in © do not have a common variable. So, 0(s) and O(u) are 
linear, and q is thus linear. 


(iii) if subterms of s do not unify with t, or s does not unify with subterms of 
t, then there are no derived pairs of rs and tou. O 


By a similar argument, it can also be proved that every derived pair of two left 
linear rules is left linear. 


The name "overlap closure” comes from the fact that the rules of OC(R) are 
a subset of the transitive closure of the rewriting relation of R: 


2.3. Lemma. If p — q is in OC(R) then p % q (using R). 


Proof. By induction on the construction of p — q in OC(R). The basis of the 
induction is the case that p — q is included in OC(R) by virtue of being a rule 
of B. Then obviously p— q holds. If (p — q) is included in OC(R) by being 
a derived pair of (r,s) and (t,u) then by the induction hypothesis for the two 
rules (r,s) and (t,u), we have r — * , and to* u. By the definition of 
derived pair and the transitivity of =, we then have p — is a QO 


2.4. Corollary. If OC(R) contains a reflexive rule, t —> t, then the rowriting 
relation of BR has a cycle. 


Proof. Immediate from the above lemma. 1 


We would like to have the converse of this corollary, that if the rewriting 
relation of BR has a cycle, then OC(R) contains a reflexive rule. This would 
permit searching for cycles by incrementally computing OC(R), looking for a re- 
flexive rule. While we have not been able to prove this in full generality, we 
will present in the next section a restricted version and its proof. The proof 
is not easy, because the overlap closure of B is in general much smaller than 
the full transitive closure of BR. It is this small size, relative to the tran-— 
sitive closure, however, that makes it feasible to use the overlap closure as 
the basis of an approach to proving uniform termination or, at least, a useful 
notion of “restricted termination,” discussed in [Guttag, Kapur, and Musser, 
1981]. 


In order to be able to prove the major result about the overlap closure, we 
need to be able to deal precisely with the various cases of overlap between suc-— 
cessive applications of rewrite rules in a rewrite sequence. We have found it 
useful to introduce a new representation of rewriting that helps to make such 
cases clear. 


The domino representation (or rewrite domino) of a rewrite rule is a rec- 
tangle divided into left and right halves in which are inscribed tree represen- 


tations of the left and right terms of the rule. Function symbols in the terms 
are represented by labelled circles in the trees. Variable symbols are 
represented by labeled rectangles, called "variable boxes.” For examples of some 
rules and their corresponding rewrite dominoes, see Figure 1. 


For each kind of domino (that is, each domino corresponding to a specific 
rule), we assume there is an infinite stock of dominoes of that kind with their 
variable rectangles filled in with all possible terms. For each such domino, we 
also assume an infinite number of copies are available in the stock. 


A sequence of rewrites can be represented by a domino layout, which is a 
two-dimensional arrangement of dominoes that obeys the rules of matching 
corresponding to those of term rewriting (Section 2). Before giving the formal 
definition of a layout, we refer the reader to an example of a rewrite sequence 
using the rules given in Figure 1 and its corresponding domino layout as shown 
in Figure 2. Another example is in Figure 3, and the two layouts in Figures 2 
and 3 could be concatenated to give a single longer layout. 


We draw trees oriented sideways with the root at the left, and we will use 
nested triangles to represent trees schematically. We define a unit layout from 
t to w to be a horizontal arrangement of a tree t, a domino with trees u and v, 


and another tree w, 


_qan< 


in which 


RULE 


1. Sx, 809,27) > af(x.¥),2) 


2. SOS, L0%Y),2) 


3.1 fix,.k(yj>x {| 


4. h(x) ix) 


5. h(x) j(x) 


6. flix), j(x)) > 1 (x) 


Figure 1. A set of rewrite rules and their corresponding rewrite dominoes. 


1 1 2 
Sthla) Stk, gth(a),b))) -»  fth(a),aftk), h(a),b)) > = xtfth(a),ftk,h(aj)).b) — — atf(fth(a),k), h(a), b) 


Figure 2. A rewrite domino layout and the|! corresponding rewriting sequence (using dominoes of 
Figure 1). - : 


3,5 4 6 
Btf(fth(a),k),h(a)),b) > gtfth(a),j(a)),b) > glflila)j(a)),b) > ef i(a),b) 
Figure 3. Another layout (a continuation of the Inyout in Figure 2). 


1. at some position, i, in t there is a subtree t’ that is identical to u, ig- 
noring the variable boxes that appear in u; 

2. the roots of t’ and u are horizontally aligned; 

3. wis the tree [t with v at i] and the roots of t and w are horizontally 
aligned. 


A layout from t to vy is defined as 


— 


1. a unit layout from t to v; or 

2. the concatenation of a layout from t to u with a layout from u to v, with 
both copies of u dropped from the arrangement; or 

3. any arrangement obtained from a layout by translating horizontally any dom— 
ino, as long as no other domino or end tree is overlaid or crossed (this 
allows compaction of a layout by placing one domino above another when they 
match disjoint subterms). 


The examples in Figures 2 and 3 illustrate a number of observations we can 
make about this representation of rewriting: 


1. In a domino layout there is no distinction between different orders of 
rewriting when the rules are being applied to disjoint subterms; e.g., the 
layout in Figure 3 would not be different if rule 5 had been applied before 
rule 4 or before rule 3. One can think of these rules being applied in 
parallel, since the order of application is always immaterial in this case. 
The layout representation just makes this property especially evident. 

2. To the property that “the rightmost term of a rewrite sequence is terminal” 
corresponds the property that “there is no way to play a domino on the lay— 
out” (formally, there is no way to concatenate a unit layout onto the lay- 
out). The layout is said to be blocked. (The layout in Figure 3 is 
blocked.) 

3. Thus the rules have the uniform termination property if and only if every 
possible layout eventually is blocked. Equivalently, there are no infinite 
layouts. 


Our purpose with this representation of rewriting is to provide a conceptu— 
al tool for finding and presenting proofs of new results about term rewriting 
systems. The first result we will prove with the aid of rewrite dominoes is one 
that will allow us to speed up the search for cycles by considering only those 
sequences of rewrites in which a “major rewrite” occurs. 


A rewrite to ca ty is called a major rewrite if it is by application of a 
rule, t —> u, to the entire term t); i.e., for some substitution 0, O(t) = to 
and © (u) = t,- When only a proper subterm of t, is matched, t, —> t, is called 


a minor rewrite. 


In a layout, a domino is called a major domino (of the layout) if it 
represents a major rewrite, and a minor domino otherwise. Pictorially, major 
dominoes are those that span the width of the layout. 


A major cycle is a cycle in which at least one of the rewrites is major. 


3.1. Theorem. If a rewriting relation has a cycle, it has a major cycle. 


Proof. Let us define the corridor of a domino in a layout to be the horizontal 
strip across the layout determined by the position and width of the domino: 


Any two corridors in a layout are either disjoint or one is contained in 
the other. Therefore, we can find a corridor that is spanned by a domino and 
which contains a layout as follows: start with any leftmost domino and follow 
its corridor to the right; whenever a domino is encountered that doesn’t lie in 
the corridor, adopt its corridor. When we reach the right end, we have a corri- 
dor containing a layout including a domino that is major with respect to it. If 
the whole layout is cyclic, the identified layout will be also, and will 
represent a major cycle, O 


We now want to define some terminology and some manipulations of layouts that 
will be useful in proving theorems about the overlap closure of a set of rules, 
Consider an adjacent pair of dominoes in a layout. Let t and u be the trees on 
the adjacent halves, where a subtree t’ of t is identical to u (possibly 
t’' = t): 


a 


If either of t’ or u is contained entirely within a variable box, i.e., the 
match is not between two nonvariable subterms, we say that the pair of dominoes 
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is weakly matched, and otherwise that it is strongly matched. 


Examples. In Figure 3, the domino pair 


that appears in the concatenation of the layouts of Figures 2 and 3 is weakly 
matched, while all the other adjacent pairs are strongly matched. 


Now suppose we have two weakly matched dominoes, as in Figure 4a, where t’' 
is contained in the x variable box. If the (s,t) domino is right-linear (i.e., 
t is linear), then the pair of dominoes can be transposed as follows: remove the 
(u,v) domino from the layout and move the (s,t) domino to the right, so that 
copies of the (u,v) domino can be inserted to the left of the (s,t) domino, one 
adjacent to each x box in s (see Figure 4b). Then the resulting configuration 
is still a Jsyout, (the dominoes all match, using the same set of rules) with 
the same end trees. This is the case also when a symmetric kind of transposi- — 
tion is performed on the layout in Figure 5a, producing the layout in Figure 5b, 
where we assume that the (u,v) domino is left—linear. 


Such transpositions cannot necessarily be performed on strongly matched 
dominoes, but we will define a different kind of manipulation for this case. 
Strong matching corresponds to the concept of overlapping in the definition of 
derived pairs: if (r,s) and (t,u) are rules that have a derived pair <p,q), 
then the dominoes corresponding to (r,s) and (t,u) can be placed in a layout so 
that they are strongly matched. The layout configuration shows just where the 
strong match ocours and identifies a potential derived pair. 


Figure 4. 


Transposition of weakly matched dominoes, where left 


Transposition of weakly matched dominoes, where right 
domino is right-linear. 


domino is left-linear. 


es 
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Suppose now that instead of our stock of dominoes corresponding to a given 
rule set R, we have a stock corresponding to OC(R), the overlap closure of R. 
Then for any strongly matched pair of dominoes in a layout there is a domino in 
our stock which corresponds to a derived pair generated by the matching pair. 
By Lemma 3 proved in the Appendix, we can replace the strongly matched pair in 
the layout by the "derived pair domino” thus identified, and the result will 
still be a layout with the same end trees. 


We are now in a position to prove: 


3.2. Theorem. Suppose the rewriting relation of R is globally finite and every 
rule in R is right-linear. If the rewriting relation of R has a cycle, OC(R) 
contains a reflexive rule. 


Proof. (By construction.) Let 


(*) tp Pt, Peer Dt 7% ¢t 


0 1 0 


be a given cycle. Corresponding to (*) is a cyclic domino layout 


(**) Cap ne 


where the dominoes correspond to rules of R. In fact since each of these rules 
is also in OC(R), we may take this layout as a layout of dominoes corresponding 
to rules of OC(R). We will show how to manipulate this layout to a form that 
shows there is a reflexive rule t — t in OC(R). 

We describe the manipulations as an algorithm operating on the cyclic lay- 
out (**), 


Step 1. [Extract major cycle.] As in the proof of Theorem 5.1, extract from 
(**) a sublayout representing a major cycle, making it the layout subject to the 
following steps. Also replace to with its subterm matched by the layout. 


Step 2. [Push major dominos to right end.] Manipulate the layout to a form in 
which all of the major dominoes are together at the right end, by means of tran— 
spositions or replacements by derived pair dominoes: whenever D is a major domi-— 
no and E is a minor domino adjacent to PD on the right 
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either D and E are weakly matched, in which case they can be transposed, or they 
are strongly matched, in which case they can be replaced by the derived pair 
domino they define - which is a major domino. This derived pair domino is also 
right linear, as Lemma 1 in the Appendix shows. 


Step 3. [Look for cycle among major dominoes.] There is now a nonempty sequence 
of major dominoes D, ,..., Dd. at the right end of the layout: 


aa) --- P< 


1 


These dominoes can only be strongly matched - except for the case where the 
right-hand side of D, is just a variable, but shortly we will show that such a 
possibility can be ruled out. If there is some contiguous subsequence 


D; peeve D that forms a cyclic layout 


QP}: PI 


then, since there can only be strong matches, these dominoes can be combined by 
j - i+ 1 replacements into a single domino D that forms a cyclic layout: 


Qe 


Let D represent (p,q). Then there is a substitution © such that Uy = O(p) and 
@(q) = Ups i.e., © unifies p and q. Furthermore, a derived pair of (p,q) and 
(p,q) is the reflexive rule (O@(p), O0(q)). Since this is in OC(R), we terminate 
the algorithm. 


Step 4. [Duplicate.] If no such subsequence exists, construct a copy of the 
layout adjacent to it and return to Step 2 with the resulting layout: 


929) F283. 


That concludes the statement of the algorithm. Before considering the question 
of termination of the algorithm, we dispense with the detail mentioned in 
Step 3: the case of adjacent major dominoes D and E where the right term u of D 
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is just a variable. We can assume the left term t of D is not just a variable 
(if it were then it would have to be the same variable as u and we would already 
have a reflexive rule). Since the layout is cyclic, if we drop D from the lay- 
out, we obtain a layout that has as its right end term a proper. subterm identi-— 
cal to the left end term. From this we conclude that.the term. sewriting zela~ 
tion’ is not globally finite, contrary to assumption. This contradiction rules 
out the case under discussion. 


It is obvious that each step of this algorithm is effective and terminat— 
ing. Overall termination is guaranteed by the following facts: 


a. At the k th execution of Step 2, the number of major dominoes, m, at the 
right end is at least 2*. 


b. Let t,x] denote the term to the left of dD. in the layout at the k th exe- 
cution of Step 3. Since each ty [kl] is derived from t,. and the rewriting 
relation is slobally finite, there are only finitely many distinct possi- 
bilities for ty [k]. By a), then, there is one such. term for which arbi~ 
trarily long layouts of major dominoes exist. Again by global finiteness, 
these layonts cannot all continue “without. producing a term, Ups that is a 
duplicate of some term previously obtained in the layout. 


Since the algorithm always terminates, and does so with a reflexive rule in 
OC(R), this proves the theorem. [D 


The corresponding theorem obtained by replacing "“right-linear” by "left-— 
linear” can also be proved in a similar manner. Combining these theorems with 
Corollary 4.3, we have: 


3.3. Theorem. Suppose the rewriting relation of R is globally finite and every 
rule in R is right-linesr or every rule in BR is teft-lisear. Then the rewriting 
relation of R is uniformly terminating if and only if OC(R) contains no reflex- 
ive rule, 


Some applications of this theorem are explored in [Guttag, Kapur, and Musser, 
1981] 


Recently, Dershowitz (1981) has propossed a "forward chain” construction for 
rewriting systems and proved that a right-linear rewriting system is uniformly 
terminating if and only if it has no infinite forward chains. However, for 
left-linear systems the analogous result requires that the left-hand sides of 
the rules be nonoverlapping, a problem that we had independently encountered 
when considering the forward chain construction and a similar backward chain 
construction. We were thus led to invent the overlap closure construction. The 
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following example from Dershowitz (1981) illustrates the advantage of the over- 
lap closure construction over forward chains. Using the forward chain construc- 
tion, it is not possible to determine the nontermination of this left-linear 
rewrite system, as pointed out by Dershowitz. The rewriting system is 


f(a), b(), x- f(x, xX, b()) 
bM -> al). 


These rules have only two forward chains, both finite: 


f(a(), b(),x) > f(x, x, B()) > f(x, x, af)), and b( ) > al), 


but we cannot conclude anything about the termination of the rules because they 
are not right-linear and, although they are left-linear, the left-hand sides are 
overlapping. But in the overlap closure construction, the rules have a derived 
pair rule 


re 


f(b(), b(), x) > f(x, x, b()), 


which, when overlapped with itself, gives the reflexive rule 


f£(b(), B01), BO) > £100), BC), bB) )), 


as a derived pair, proving that the rules are nonterminating. 
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4. CONCLUSION 


We have discussed two ways to make use of finite subsets of the overlap 
closure: proving restricted termination and disproving uniform termination. We 
have explored, without much success, using such finite subsets as parts of 
proofs of uniform termination. We conjecture that for certain classes of term 
rewriting systems it should be possible to compute a bound, n, such that if a 
cycle exists, there exists a cycle in which every term is of size n or less. 

For such classes, the overlap closure would provide a decision procedure for un-— 
iform termination. 


Another open question about the generality of the overlap closure construc— 
tion is whether the assumption of left-linearity or right-linearity is neces— 
sary. Although we have not been able to find proofs of our results without this 
assumption, we have also been unable to construct a counterexample. In any 
case, as discussed above, the overlap closure construction is more general than 
either forward or backward chain constructions. 


For the class of term rewriting systems to which it may be applied, con- 
structing the overlap closure is as useful as constructing the complete transi- 
tive closure. Furthermore, using the overlap closure to show restricted termi- 
mation or the absence of uniform termination will always involve computing fewer 
terms than would using the transitive closure. We do not yet have much empiri- 
cal or analytical evidence as to the absolute efficiency of using the overlap 
closure for these purposes. The key question is how many terms must be examined 
in order to demonstrate that no cycle is possible for terms of up to size n. 

The fow examples we have tried, using a preliminary implementation, we have 
found encouraging. 


ACKNOWLEDGMENTS 


The basic idea of conducting a search for repeated terms (cycles) or sub- 
terms sprang from discussions in 1977 between one of the authors (Musser) and 
Dallas Lankford. We thank P. Gloess, G. Huet, and J. Levy for their interest 
and assistance in refuting some of our earlier conjectures, thus helping us ar- 
rive at the notion of the overlap closure and the theorems of Section 3. We 
also thank P. Narendran for assistance in constructing the proof of the theorems 
in the Appendix and J. Goguen for discussions of the approach to term rewriting 
used in OBJ. John Guttag’s research is supported in part by the National Science 
Foundation under grant MCS78-01798 and by an Office of Naval Research Contract 
with DARPA funding #N00014-75-C-0661. 


- 16 - 


REFERENCES 
N. Dershowitz, '’Termination of Linear Rewriting Systems - Preliminary Ver-— 
sion,’' in Automata, Languages, and Programming, Eighth Colloquium, Israel (Eds: 


S. Even and 0. Kariv), LNCS 115, Springer Verlag, New York, 1981. 


J.A. Goguen and J. Tardo, '’An Introduction to OBJ-T,''’ Proc. of Conf. on 
Specification of Reliable Software (1979). 


J.V. Guttag, E. Horowitz, and D.R. Musser, ''Abstract Data Types and Software 
Validation,’’ Comm. A.C.M. 21 (1978), 1048-1064. 


J.V. Guttag, D. Kapur, and D.R. Musser, On Proying Uniform Termination and Res-— 
tricted Termination of Rewriting Systems, Report No. 81CRD272, G.E. C. Research 
and Development, Schenectady, New York, Nov. 1981. 


G. Huet, '’Confluent Reductions: Abstract Properties and Applications to Term 
Rewriting Systoms,'’ J. ACM 27 (1980), 797-821. 


G. Huet, D.S. Lankford, ''On the Uniform Halting Problem for Term Rewriting Sys— 
tems,'’ Rapport Laboria 283, IRIA, Mars 1978. 


G. Huet, D.C. Oppen, '’Equations and Rewrite Rules: a Survey,’’ Formal Languages 
Perspectives and Open Problems (BR. Book, ed.), Academic Press (1980). 


D.E. Knuth, P. Bendix, '’Simple Word Problems in Universal Algebra.’’ Computa— 


tional Problems in Abstract Algebra (J. Leech, ed.), Pergamon Press (1970), 
263-297. 


D. Musser, ‘'Abstract Data Type Specification in the AFFIRM system.’’ JEEE Tran- 
sactions on Software Engineering 6 (1980), 24-31. 


G.E. Peterson and M.E. Stickel, Complete Sets of Reductions for Equational 


Theories With Complete Unification Algorithms, Tech. Report, Dept. of Computer 
Science, U. of Arizona, Tueson, Sept. 1977. 


-17 - 


APPENDIX 


1. Lemma. Let t and u be unifiable terms with disjoint variable sets, and 0 be 


their most general unifier. Let @* be the restriction of © to the variables of 
u, say Of = [e,/v,.-.-.0 /vi). If t is linear, then all variables in Op reeere, 


are distinct. 


Proof. For every variable x having k (1) occurrences in u, replace different 


occurrences of x by distinct variables x that do not appear in t and uu. 


q7cccohy 
Let u’ be the resulting term, which is linear. 


By Lemma 2, in the m.g.u. 0’ of t and u’, substitutions for distinct variables 
in t and vu’ do not have a common variable. Let . be the m.g.u. for the set of 
terms @'(x,), 1<i<k, the substitutions for the variables used to replace multi- 
ple occurrences of x in u. If these o, for every variable x having multiple oc- 


currences in u are composed with 0’, we get a unifier of t and u. 


In this unifier, substitutions for variables in u do not have a common variable. 
From this, it is evident that the m.g.u. 8 of t and u cannot have substitutions 


for variables in u that share common variables. O 


2. Lemma. For two unifiable terms t and u, if t and u are linear, then the sub- 
stitutions in their m.g.u. © for any two distinct variables of t or u do not 


have common variables. 
Proof. By induction on the structure in term t. 


Basis: t is a variable. 


Then 0(t) = u and the statement trivially holds. 


Inductive step: t = £(t,,-.-.t,) 
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For t and u to be unifiable, either u is a variable or u == f(u,,..-50,)- The 


case of u being a variable is handled as in the basis step. 

For the case u == f(u,,....0,), for each i, 1¢i<n, t; must unify with us by 
their m.g.u. 6;> say. By the inductive hypothesis, the statement holds for each 
of 6;- Since t and u are linear, the disjoint union of ®.- 1<i<n, is the m.g.u. 
@ of t and u. It follows that the statement of the lemma holds for @ also. O 


3. Lemma. Suppose to +> t, using r — s applied at position i, ty - ty using 


1 
t + a applied at i.j, and s/j and t overlap determining the derived pair 


<p,q> = <O(r),[@(s) with O(u)at j]>. Then ty ~ t, using p — q applied at i. 


A similar result holds for the case in which s unifies with a subterm of t. 
Proof. Rename the variables of t and tu, if necessary, so that s and t have no 
variable in common. There is some subterm to/i and a substitution 8, such that 


@, (x) = to/i and t, = [t, with @, (3) at i]. 


1 


Again, there is some subterm t,/G.3) and a substitution 8, such that 


@, (t) = t,/C.j) and t, = [t, with 8, (a) at i.jl]. 
Since the variables of s and t are disjoint, we have (@, U @,) (8/4) = @, (8/4) = 
@, (t) = (0, U 6,)(t). That is, 8, U 8, is a unifier of s/j and t and therefore 


has © as a factor: 


8, U 8, = 8, e 8, for some substitution 8,- 


Thus to/i = 6, (x) = (8, U @,) (x) = (6, e 8) (r) = @, (@(r)) = 0, (Pp). That is, ty 
is matched by p at i. Now consider @3(q); it is 


8 


A ([O(s) with O@(u) at j]) 


= fe, (@(s)) with @,(@(u)) at jl 
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= [6, (s) with ®, (a) at jl. 


Thus t 


[t, with ®, (a) at i.jl 


[ft, with @, (s) at i] with ®, (u) at i.j] 


if 


{t, with [6, (s) with @, (u) at j] at i] 


0 
= [ty with ®,(q) at i], showing that 
ty >t, using p—q applied at i. We omit the proof of the case in which s uni- 


fies with a subterm of t. 
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